Near-optimal Anomaly Detection in Graphs using Lovasz Extended Scan Statistic
نویسندگان
چکیده
The detection of anomalous activity in graphs is a statistical problem that arises in many applications, such as network surveillance, disease outbreak detection, and activity monitoring in social networks. Beyond its wide applicability, graph structured anomaly detection serves as a case study in the difficulty of balancing computational complexity with statistical power. In this work, we develop from first principles the generalized likelihood ratio test for determining if there is a well connected region of activation over the vertices in the graph in Gaussian noise. Because this test is computationally infeasible, we provide a relaxation, called the Lovász extended scan statistic (LESS) that uses submodularity to approximate the intractable generalized likelihood ratio. We demonstrate a connection between LESS and maximum a-posteriori inference in Markov random fields, which provides us with a poly-time algorithm for LESS. Using electrical network theory, we are able to control type 1 error for LESS and prove conditions under which LESS is risk consistent. Finally, we consider specific graph models, the torus, knearest neighbor graphs, and ǫ-random graphs. We show that on these graphs our results provide near-optimal performance by matching our results to known lower bounds.
منابع مشابه
A Power-Enhanced Algorithm for Spatial Anomaly Detection in Binary Labelled Point Data Using the Spatial Scan Statistic
This paper presents a novel modification to an existing algorithm for spatial anomaly detection in binary labeled point data sets, using the Bernoulli version of the Spatial Scan Statistic. We identify a potential ambiguity in p-values produced by Monte Carlo testing, which (by the selection of the most conservative p-value) can lead to sub-optimal power. When such ambiguity occurs, the modific...
متن کاملAnomaly Graphs and Champions
A scan statistic methodology for detecting anomalies has been developed for application to graphs. We equate anomalies with vertices that exhibit high local connectivity properties. In particular we look for cases where all vertices have similar local connectivity, except for one vertex (a champion) that has much higher connectivity at a certain level. For example, a neighborhood champion is a ...
متن کاملOn the anomalous behaviour of a class of locality statistics
A scan statistic methodology for detecting anomalies has been developed for application to graphs, where “anomalies” are equated with vertices that exhibit distinctive local connectivity properties. We present an “anomaly graph” construction that illustrates the capabilities of these scan statistics via the behaviour of their associated locality statistics on our anomaly graphs. © 2007 Elsevier...
متن کاملCharacterization of L-norm Statistic for Anomaly Detection in Erdős Rényi Graphs
We devise statistical tests to detect the presence of an embedded ErdősRényi (ER) subgraph inside a random graph, which is also an ER graph. We make use of properties of the asymptotic distribution of eigenvectors of random graphs to detect the subgraph. This problem is related to the planted clique problem that is of considerable interest.
متن کاملScan Statistics on Enron Graphs
We introduce a theory of scan statistics on graphs and apply the ideas to the problem of anomaly detection in a time series of Enron email graphs. Corresponding author: Carey E. Priebe =
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2013